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Abstract 

In Constraint Programming, global constraints allow to model and solve many combinatorial 
problems. Among these constraints, several sortedness constraints have been defined, for which 
propagation algorithms are available, but for which the tractability is not settled. We show 
that the sort(C/, V) constraint (Older et. al, 1995) is intractable (assuming Pt^NP) for integer 
variables whose domains are not limited to intervals. As a consequence, the similar result holds 
for the sort{U,V, P) constraint (Zhou, 1996). Moreover, the intractability holds even under 
the stability condition present in the recently introduced keysorting([/, V, Aeys, P) constraint 
(Carlsson et ah, 2014), and requiring that the order of the variables with the same value in the 
list U be preserved in the list V. Therefore, keysorting({7, V, Aeys, P) is intractable as well. 
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1 Introduction 

Constraint programming systems support an increasing number of global constraints, i.e. con¬ 
straints for which the number of variables is arbitrary. Such constraints define an important 
search space, that may be pruned using constraint propagation algorithms. Implementing a 
certain notion of consistency, a propagation algorithm removes infeasible values from the do¬ 
mains of the variables, and its efficiency is evaluated both with regard to its ability to limit the 
search space, and with regard to its running time. Dealing with global constraints in general, 
that is, without fixing a constraint (or a set of constraints), is intractable [T]. However, each 
constraint has its own complexity, which may range from tractability at all levels of consistency, 
as for the alldifferent constraint m, to intractability at relatively low levels of consistency, as 
for linear equations m- 

The tractability of a particular constraint is not always settled when the constraint is de¬ 
fined, and this is the case for the sortedness constraints sort(f7, P) [^, sort(f7, V, P) [TB] and 
keysorting(P, P, Aej/s, P) [2]. Although one or several of these constraints are implemented 
in well-known systems like SICStus Prolog [2|, Gecode m and Choco m, as well as in the 
constraint modelling language MiniZinc [S], their hardness is unknown. 

In this paper we show that the intuitively simplest of these sortedness constraints, namely 
sort([/, V), is intractable (unless P = NP) even in the case where the domains of the variables in 
U are disjoint, and this leads to the intractability of sort(C/, V, P) and of keysorting([/, V, Keys, P). 

The organisation of the paper is as follows. In Section we give the terminology and 
notations used in the paper. In Section]^ we transform, using ideas from [7], the search for a 
support of sort(f7, V) into a graph matching problem that we call SortSupport, and we show 
how to associate with each instance of the NP-complete problem Not-All-Equal 3SAT an 
instance of SortSupport. The next section is devoted to the proof that our construction is a 
polynomial transformation [3], implying the NP-completeness of SortSupport. In Section]^ 
we deduce hardness results about the three sortedness problems. Section]^ is the conclusion. 
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2 Constraints and consistency 

In this paper, we deal with constraints over integer domains. Given a variable w, we denote 
Dom{w) its domain, which is assumed to be a finite set of integers. When Dom{w) is written 
as an interval [L.r] (with integer l,r such that Z < r) or a union of intervals, we understand 
that it contains only the integers in the (union of) interval(s), i.e. [l-.r] is defined as [L.r] := 
{d G Z\l < d < r}. 

A constraint C is a couple (W,i?), where W = {wi,W 2 t ■ ■ ,wt} is a set of variables with 
associated domains Dom{wi), and i? is a t-ary relation over Z (equivalently, a subset of Z*). 
The constraint C = (W, R) is satisfied by a t-tuple <5 = { 81 , 62 , ■■■ ,St) assigning the value Si to 
variable Wi, 1 < i < t, ii 6 G R. Denote V := Dom{wi) x Dom{w 2 ) x ... x Dom{wt). If C is 
satisfied by a t-tuple 6 G V, then 5 is a support of C. 

A constraint satisfaction problem (or CSP) is defined as a set of variables with their asso¬ 
ciated domains, and a set of constraints defined on subsets of the variable set. A solution of a 
CSP is an assignement of values from the associated domains to the variables that satisfies all 
the constraints. In order to solve a CSP, constraints are successively used to prune the search 
space, with the help of propagation algorithms that often seek to enforce various consistency 
properties, defined below (following [3]). A domain V is said domain consistent for the con¬ 
straint C — {W,R) if, for each variable Wi, 1 <i <t, and for each value 5i G Dom{wi), there 
is a support of C assigning the value Si to Wi. Domain consistency is a strong requirement, 
for which the following variants of bounds consistency are progressively weaker, but often very 
useful, alternatives. 

Denote infx)(rci) and supp(r(;i) respectively the minimum and maximum value in Dom{wi). 
We say that a domain T) is houndsfp) consistent for C if for each variable Wi and for each value 
Si G {infx)(wi),supx)(u'i)} there exist integers Sj with Sj G Dom{wj), 1 < j < t and j i, 
such that (^1,(52, ■ ■ ■ satisfies C. We say that a domain T> is boundsfiL) consistent for C if 
for each variable Wi and for each value Si G {inf-p(?iii), supp(wi)} there exist integers Sj with 
inf-p(wj) < Sj < vaix>{u}j), 1 < j < t and j ^ i, such that (<5i,(52, ■ ■ ■ ,St) satisfies C. Finally, 
we say that a domain T) is & 0 Mn(is(IR) consistent for C if for each variable Wi and for each value 
Si G {inf'p(iCi), sup 2 )(wi)} there exist real numbers Sj with iiiix>{wj) < Sj < infx)(wj), 1 < j <t 
and j i, such that (i5i,(52, ■ ■ ■ ,St) satisfies C. 

We now define the sortedness constraints: 

• the sort([/, V) constraint, defined in j^, has variable set UUV, where U = {ui,U 2 ,..., m„} 
and V = {vi,V 2 , ■ ■ ■ ,Vn}, and is satisfied by a 2n-tuple of values assigned to the variables if 
and only if the variables in V are the sorted list of the variables in U. The correspondence 
between the variables in U and those in V is therefore a permutation. Propagation 
algorithms achieving bounds (Z)-consistency have been proposed in and |7]. 

• the sort(t7, V, P) constraint, defined in [T3], generalizes the sort(C/, V) constraint by adding 
a set P of n variables with domains included in {1, 2,..., n} in order to bring the per¬ 
mutation into the variable set of the constraint. This constraint thus has variable set 
UUVU P and is satisfied by a 3n-tuple of values if and only if (a) the variables in V are 
the sorted list of the variables in U, (b) the variables in P are all distinct, and (c) the 
permutation associating the variables from U and V is the one defined by the variables 
in P. The propagation algorithms for the sort(Z7, V) constraint are able to reduce the 
domains of the variables inUUV similarly, but achieve bound(Z)-consistency only on the 
P-domains, and not on the U and P-domains |14| . 

• the keysorting(P, V, Keys, P) constraint (where Keys is a positive integer), defined in (2], 
allows to add two features with respect to sort(P, P, P) : (a) each variable is a Zi-tuple 
{h>\ and integer, common to all variables), whose first Keys elements form the sorting 
key of the variable, using lexicographic ordering; and (b) the sorting has to be stable, i.e. 
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any pair of variables with the same key value must have the same order in U and in V . 
The domain Dom{z) of any variable z from U U V is thus a /i-tuple of domains. When 
Keys = 1, the lexicographic order of the keys is the classical order between integers, and 
thus keysorting(C7, y, 1, P) is similar to sort(t/, V, P), except that it requires the stability 
of the sorting. 

Given two non-empty sets of integers D and E, we write D <iex E whenever there exist 
values d G D and e G E such that d < e. This is not (and is not intended to be) an order on 
sets, but allows to compare the domains of the variables with respect to the possibility to have 
a given order between the assigned values. 

3 Links between sortedness, graph matching and 3SAT 

Consider two sets of variables U = {ui | 1 < * < n} and V = {vi 11 < z < n}, with finite 
integer domains Dom{ui) and Dom{vi), for all z, 1 < z < n. Following [7], we define the 
intersection graph T{U,V) of U and V as the bipartite graph with vertex set U UV and 
edge set {(ui,Vj) \ Dom{ui) H Dom{vj) ^ 0}. A matching of r(P, y) is an injective function 
a : V' C V —>■ U such that (a(v),v) is an edge of T for each v G V■ We also use the notation 
M = {(tT(z;), v),v G V'} to designate the same matching. A matching M saturates a vertex x if 
there exists an edge in M with endpoint x. We say that M is a perfect matching if it saturates 
all the vertices in r(P, y). 

Denote Qi = Dom{(j(vi)) H Dom{vi), for each Vi for which (T(ui) is defined. Then testing 
whether sort(P, y) has a support is equivalent to solving the following problem: 

SortSupport 

Instance: Two sets of variables U = {ui 1 1 < z < zz} and V = {vi 1 1 < z < n}, with finite 
integer domains Dom(ui) and Dom{vi), for all z, 1 < z < n. 

Question: Is there a perfect matching cr : y —>■ P of T{U, V) such that Qi for all z with 
1 ^ Z ^ zz and Q\ iflex Q 2 'Elex ■ • ■ “filex Qti^ 

We show that SortSupport is NP-complete, and this even in the case where the domains 
Dom{ui), 1 < z < zz, are disjoint. To this end, given r(P, y), a matching fulfilling the required 
conditions is called a sort-matching. Note that the order vi,V 2 , ■ ■ ■ ,Vn of the elements in V is 
important, since it defines the sort-matching. 

We adapt the graph construction in [10], and therefore use the same notations. The re¬ 
duction is from the NP-complete problem Not-All-Equal 3SAT jj], the variant of 3SAT in 
which each clause is required to have at least one true and at least one false literal. 

Let H — Hi A H 2 /\. ■. A Hk he an instance of Not-All-Equal 3S AT, where each clause Hi, 
i = 1,2,... ,k, contains three literals from the set xi, xi, X 2 ,X 2 , ■ ■ ■, Xp, xf. We assume that, 
for each j = 1,2,... ,p, the literal Xj occurs in the instance H as many times as xf (otherwise, 
add to H an appropriate number of clauses {xj V Xj Vtcf) or {xj VxjVtc])). We note occ{j) the 
total number of occurrences of Xj in a clause, either as a positive oeeurrence (i.e. as Xj) or as 
a negative oeeurrence {i.e. as xf). 

We wish to build an instance U, V of SortSupport such that r(P, y) consists of: 

• a unit graph G® (see Figure j^, for each positive or negative occurrence of a literal Xj in a 
clause Hi. The vertices of G® are the variables ait, bp c®. from U and {at)' , {ht)' , (c®)' from 
y joined by the four edges (a®, (6®)'), (6®, (c®)') (called up-edges), {bp (a®)') and (c®, (6®)') 
(called down-edges). Unit graphs are positive or negative depending on the occurrence of 
Xj they represent. 
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Figure 1: The unit graph G) for a positive or negative occurrence of Xj in Hi. Note that variables 
a^jjbjjCj belong to U and variables (a*)', (&j)^ {c^jY belong to V. 



(cp’ 


(bp’ 


(ap’ 


Figure 2: The consistency component CCj assuming that the literal Xj occurs in clauses Hs,Hr and 
literal SJ occurs in clauses Hi, Hg. The horizontal edges are respectively the up-linking edges and the 
down-linking edges, according to their position. 


• a consistency component CCj (see Figure]^ for each literal Xj, connecting all unit graphs 
associated with positive and negative occurrences of Xj. The unit graphs are arbitrarily 
ordered such that they correspond alternately to a positive and to a negative occurrence, 
the first unit graph being associated with a positive occurrence of Xj. Up-linking edges 
join the a® vertex of a negative unit graph to the {cjY vertex of the positive unit graph 
following it, in a circular way. Down-linking edges join the cj vertex of a positive unit 
graph to the (a’jY vertex of the negative unit graph following it. 

• a truth component Di (see Figure]^ for each clause Hi, connecting the three unit graphs 
associated with the literals in Hi. Note that the negative unit graphs are drawn with 
up-edges down, and vice versa. Four vertices di,ei (defined to be in U) and d', e' (defined 
to be in V) are added, the former ones joined to the {CjY vertex of every negative unit 
graph and to the {Cj)' vertex of every positive unit graph, whereas the latter ones are 
joined to the c* vertex of every negative unit graph and to the a* vertex of every positive 
unit graph. These edges are called lateral edges. 

• a completion component E providing, for each i with 1 < f < A:, an edge between and 
each e' such that i j. 

The truth components allow to locally give truth values to the literals, whereas the consis¬ 
tency components guarantee that the locally given truth values are globally correct, that is, each 
literal is either true or false but not both. The completion component ensures the existence of 
a perfect matching in the graph. Notice that, if a clause Hi contains two (or more) occurrences 
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Figure 3: The truth component Di assuming that the associated clause Hi is (alj V a;; V xj). The 
dashed lines outgoing from vertices belonging to unit graphs are the up- and down-linking edges inside 
the consistency component containing the vertex. The dashed lines outgoing from ei and e'i symbolize 
the set of edges in the completion component E. Note that the negative unit graphs are drawn with 
up-edges down, and vice versa. 


of the same literal Xj, then notations and G*’^ should be used to identify the unit graph 
associated with each occurrence. We do not enter into such details in our presentation, in order 
to keep it as simple as possible. 

To define the required order on the elements in V, we assume that in each unit graph the 
elements [a])', (^5)^ (cjfy are ranged in this order, and in each consistency component CCj unit 
graphs are ordered according to the alternate arbitrary order chosen to build CCj (recall that 
the first unit graph in CCj is positive). Then, the global order on V is built by considering the 
GGi, CC2, ..., CCp consistency components in this order, and by adding vertices d{, i 

e{, €2,, e'/, in this order at the end. 


Example. Let ff = H 1 AH 2 with Hi : xiVX 2 'dx 3 and H 2 : XiWx^VxH- Then the consistency 
component GGi contains the unit graphs Gf (positive) and G) (negative), in this order; CC 2 
contains (positive) and G| (negative), in this order; and GG 3 contains G 3 (positive) and G§ 
(negative), in this order. There are two truth components Di and D 2 corresponding respectively 
to Hi and ify. By definition, Di includes the unit graphs G),G 2 and G|, whereas D 2 contains 
the unit graphs Gf,G| and G|. The completion component has only two edges, ( 62 , 63 ) and 
( 61 , 62 ). The order of the variables in V is (af)', (bf)', (cf)', (a))', ( 6 })', (c})', (a^)', (b^)', (c^)', 

(aiY, (bir, (dr, («!)', (b^j, ( 4 )', (air, (biy, (dr, 4,4, ey e'2. 


To finish our construction, we have to define the domains of the vertices in U and in V that 
exactly define the sought intersection graph r(?7, V). To simplify the notations, the domain of 
a vertex is denoted similarly to the vertex, but with an upper case instead of a lower case, (e.g. 
Dom{a‘j) is denoted A^, except for the vertices di,d',ei, 6 '. 

Remark 1 Note that in the sequel we do not seek to minimize the sizes of the domains we 
define, as this is not important for the proof of NP-completeness. In particular, we avoid 
domains that are singletons, in order to allow a better illustration of the domains and their 
intersections in Figure ^ 
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Figure 4: Domains of the variables in G] and Di, for fixed i and j, and their overlaps with other 
domains. The clause Hi is assumed to contain occurrences of literals Xj,Xi,Xf. The notations and 
{AjY refer respectively to the domains of the variables aj and (oj)', from the unit graphs Gj and GJ 
that precede and respectively follow G] in the circular order of GGj. The main domains of the variables 
are drawn with plain lines. Their secondary domains are drawn with dashed lines for a positive Gj, 
and with dotted lines for a negative Gj (and this convention is extended to G® and Gj). We assumed 
that we are not in the third case of the definition of Jfj. 


Each unit graph (see Figure]^ is defined by domains included into an interval of Z, of t = 24 
consecutive integers (that we call a block), so that each consistency component CCj is defined 
on f * occ{j) consecutive integers, and all the consistency components are represented on the 
interval [l..t * TFj^^occ^j)]. Noticing that YFj^-yOcc{j) = 3fc, where k is the number of clauses, 
and defining m = 3kt, we define first the domains for the variables d), 1 < i < k, used in the 
truth components, as follows: 

Dom{d'i) = [m + 2i — l..m + 2i] 

This interval is devoted to creating the lateral edges in D^. Note that the last integer used by 
an interval in Dom(d'i^) is to + 2k, that we denote q. For each i, we define: 

Dom{e'i) = [q + 2i — l..q + 2i]. 

This interval is dedicated both to the lateral edges in Di and to the edges in E. 

Now, each unit graph G® is defined by domains inside the block [ht + l..{h + l)t] (with 
little exceptions), where h is the number of unit graphs before G* in the global order, i.e. 
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h = Y,q^jOcc{q) + r — 1, where r is the position of G* in the alternate order of CCj. The 
domains of a*,6*,c*, (a})^ (^})^ (cj)' are respectively defined as (see Figure |^: 


A] = [ht + 3..ht + 6] U X] 

= [ht + 9..ht + 12] 

C] = [ht +15../it +18] u r; 

(^*)' = [/it + 7../it + io]uy/ 

(i?*)' = [/it + 5../it + 8] U [/it + 13../it + 16] 
(qy = [/it + ii../it + i4]uq 


where 



Dom{d[) U Dom{e'^), 

[{h + l)t + 21..(/i + l)t + 22)], 

[(/i + 1 — occ(j))t + 21.. 

..(/i + 1 — occ{j))t + 22], 


if Gj is a positive unit graph 

if G* is a negative unit graph and, moreover, 

Gj is not the last unit graph in GGj 

otherwise 


r 0 

y Dom{d[) U Dom{e[), 


if G'j is a positive unit graph 
if Gj is a negative unit graph 


f [ht -f l../it 2], 
y [/it — 7../it — 4], 


if Gj is a positive unit graph 
if Gj is a negative unit graph 


r [/it + 21../it + 22], 
( [ht 23..ht 24], 


if G'j is a positive unit graph 
if G'j is a negative unit graph 


The part of each domain defined by Xj,Tj,Yj or Zj before is called the secondary domain 
of respectively a'j , c'j ^ {a'j)\ {c'j)' . The remaining part of the domain, i.e. the one explicitly 
stated in the definitions of A'j, B'j, Gj, {A'j)', [B'j)', (Gj)' above, is called the main domain of the 
corresponding variables. Intuitively, the main domains allow to build the unit graphs and the 
down-linking edges of the consistency components. For a unit graph GJ, the main domains of 
its variables belong to its block [ht + l..{h + l)t]. The secondary domains are devoted to the 
connections with other unit graphs or with the other vertices of the truth components. 

It remains to give the domains of di and for each i. Recalling that a unit graph GJ exists 
if and only if Xj has a positive or egative occurrence in Hi, we define: 

Dom{di) = Ug,i jg positive^ '^Gjis negative 

Dom{ei) = Ugi jg positive^ is negative^j ^siiiDom{ei) 

Then, Dom{di) contains intervals from the domains of the R-variables in Di, whereas Dom{ei) 
contains in addition the domains of all ej with s ^ i. 


The construction before, obviously polynomial, yields a bipartite graph with n = Ilk vertices 
in each part of the bipartition. Among these vertices 3k * 6 (3k * 3 in each part) are in some 
consistency component and 4/c (2k in each part) are in some truth component (but not in a 
consistency component). We show now that the intersection graph of the variables we defined 
is indeed the graph we wished to build. 
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Claim 1 The edges built using the domains defined above for the variables in U and V are 
exactly those of the unit graphs G® , consistency components CCj, truth components Di and 
completion component E, for all i and j such that Xj has a positive or negative occurrence in 

A. 


Proof. We consider each variable and the edges it belongs to. It is easy to notice that the 
main domain of a variable among a®, 6 ®, (a®)', or (c®)' can have non-empty intersections 
only with the main domains of the same set of variables, The only exception to this rule is c® 
(see below and Figure |^. Moreover, the secondary domains of aJjC®, (a®)', (c®)' are specially 
defined to guarantee intersections with domains of other variables (according to the desired 
edges in CCj and Di), and thus have only empty intersections with the other domains in the 
block of G®. 

Variable a®. The main domain of a® has non-empty intersection only with {B))' resulting 
into the edge (a®, {b))'). The secondary domain of a® contains the domains of d) and e®, if G® is 
a positive unit graph, meaning that A) VDom{d'i) ^ 0 and A® nDom(e') ^ 0 and thus that the 
edges (apd'i), (a®, e') exist. It is easy to see that no other edge with endpoint a® exists in this 
case. If G® is a negative unit graph, then we have two sub-cases. If it is not the last unit graph 
in CCj, then X® = [(/i-|-l)t-|-21..(/i-|-l)t-|-22] and it has non-empty intersection only with the 
secondary domain Zf = [{h + l)t -I- 21..{h + l)t + 22] of the variable (cj)' in the positive unit 
graph Cj immediately following it in CCj. Then we have the up-linking edge between G® and 
Cj. If G® is the last unit graph in CCj, then X® = [(/i -|- 1 — occ(j))t-|-21..(/i-|-1 — occ(j))t-I-22] 
and X® is included in the block of the first positive unit graph of CCj, denoted G®®. Then X® 
has non-empty intersection only with the interval Z®® in the domain of the variable (c*®)® of the 
first unit graph, and thus we have again the up-linking edge between the two unit graphs. 

Variable 6 ®. Obviously, has non-empty intersections with the main domains of {A))' 
and (G®)®, defining the edges {bj,{ajy) and (&®, (c®)®) of the unit graph. No other non-empty 
intersections exist with B). 

Variable Cj. Again, we have an obvious intersection of G® with {Bj)', implying the edge 
{dj,{bty), and no other intersection with main domains. However, the main domain [ht -\- 

15.. /it -I- 18] of c® also has non-empty intersection with the secondary domain Yf = [(/i -|- 1)/ — 

7.. {h + l)t — 4] of the variable (aj)® in the unit graph following the current one, if this latter 
graph is negative. We are therefore in the case where the two unit graphs are joined by the 
down-linking edge (c®, (aj))®. Analyzing now the intersections of the secondary domain of c®, 
we have again two sub-cases. In the case G® is negative, then its intersections with Zlom(d') 
and Dom{e'i) are non-empty and we have the desired lateral edges in the truth component for 
the clause Hi. In the case G® is positive, there is no secondary domain for c®. 

Variable {a^j)'■ On the main domain, (A®)® has non-empty intersection with H®, yielding the 
edge (&®, (a®)®), and also with {B))', yielding no edge since both variables are in V. Consider now 
the secondary domain of (a®)'. When G® is positive, Yf is included into Dom{di) and Dom{ei) 
and we have the sought lateral edges in Di. When G® is negative, we have Yf = [ht — 7..ht — A] 
which is the same as [{h — l)t + 17..{h — l)t + 20], and thus has non-empty intersection with 
the main domain of the vertex c® in the unit graph immediately preceding G® in CCj. This 
confirms the down-linking edge already found for the variable c®. 

Variable (bj)'. Non-empty intersections with domains of variables from U are found only 
for variables a® and c), confirming the edges already found above. 

Variable (c®)®. The edge {bj, {c))') is confirmed by the non-empty intersection of (G®)® with 
H®. No other intersection is found with the main domain of {Cj)'. For its second domain, 
we have the intersection with Dom{di) and Dom{ei) yielding the corresponding lateral edges, 
in the case where G® is negative. In the contrary case, Zj = [ht -I- 21..ht + 22] and the only 


possible intersection is with some defined for another unit graph G®, which must be negative 
according to the definition of Xj. With X^ = (ft,'+l)t+21, (h'+l)t+22], we must have h' = h—1, 
meaning that Gj is the negative unit graph immediately preceding Gj in CCj, and thus we 
have the up-linking edge (a®, (c*)'). With X® = [{h' -|- 1 — occ{j))t + 21..{h' -I- 1 — occ{j))t + 22], 
we have that X® is the secondary domain of the a® vertex in the last unit graph G® in CCj. 
We also have that Xj is included in the block of the first unit graph G*^ in CCj. Thus Xj has 
non-empty intersection with Zj if and only if Zj also concerns G*\ that is if (cj)' = The 

resulting edge is then (a®, which is the up-linking edge closing the circuit of unit graphs 

in CCj. 

Variable di. By definition, Dom{di) is made of the three secondary domains of variables of 
type {djY and (c*)', according to the positive or negative occurrence of Xj in Di, for which the 
edges have also been confirmed above. We notice that all the other intersections with domains 
of variables from V are empty. 

Variable d'. By definition, Dom{d'i) has non-empty intersection only with Xj (when Gj is 
positive) and with Tj (when Gj is negative) and this yields the expected lateral edges. 

Variable e^. Again by definition, Dom{ei) is, as Dom(di), made of three secondary domains 
of variables of type (aj)' and (cj)' allowing to build the expected lateral edges, but also of A: — 1 
domains of variables ej allowing to build the edges from E incident with e^. 

Variable e(. As all the edges and non-edges with vertices from U have already been verified, 
there is nothing more to check here. 

All the edges in the unit graphs, consistency components, truth components and completion 
component are correctly built, and no undesirable edge is added. The claim is proved. I 

4 The proposed construction is a polynomial transforma¬ 
tion 

In this section, we show that there is a truth assignment satisfying H with at least one true and 
one false literal in each clause if and only if r([/, F) has a sort-matching. To this end, we first 
prove that: 

Claim 2 Let C^j be an arbitrary unit graph. Then no sort-matching M can contain simultane¬ 
ously the edges {apYbj)') and (&j,(aj)'), nor simultaneously the edges (cpibjY) and (bj,(cjY). 


Proof. We have that Aj n (BjY = [ht -\- 5..ht -\- 6 ], whereas Bj n (Aj)' = [ht -I- 9..ht -I- 10]. 
Since (aj)' precedes (6j)' in the order on V, if a sort-matching contained both edges (aj, (6j)') 
and {bj, (aj)') then we should have Bj n (Aj)' < 1 ^^, Aj n (Bj)', and this is obviously false. 

Similarly, Gjn(Bj)' = [Alt-f 15../it -I- 16], whereas Bjn(Gj)' = [/it-|-11../it-|-12], which does 
not satisfy Gj n (Bj)' Bj n (Gj)'. I 

Claim 3 Let CCj be an arbitrary consistency component. Then any sort-matching M satisfies 
the following property: 

a) either all the up-edges and all the down-linking edges in CCj belong to M; 

b) or all the down-edges and all the up-linking edges in CCj belong to M. 

Proof. Given a unit graph Gj from CCj, M must contain exactly one edge among (aj, (/>j)') 
and (cj, {bjY)t so that (6j)' is saturated, and similarly exactly one edge among {bj, {a^Y) and 
{bY{CjY)- By Claim 1^ it results that M contains either {a'‘j,{b'‘jY) and {bj,{CjY) (that is, the 
up-edges in Gj) or (cj, (5j)') and {bj, (aj)') (that is, the down-edges in Gj). 
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In the former case, c® (if G® is positive) or {a’j)' (if G® is negative) can only be saturated by 
the down-linking edge with one endpoint in G® (see Figure [^, implying that the next (previous, 
respectively) unit graph Gj in CCj, in a circular way, also has its up-edges in M. The same 
deduction may be done for the unit graph G®’ following (respectively preceding) G®, as follows. 
Since the up-linking edge is not used by M (the up-edges are already in M), the vertex of G® 
incident with this edge must be saturated locally, and this can only be done by an up-edge of 
G®. Thus, the property of a unit graph to have its up-edges and its incident down-linking edge 
in M is propagated to all the consistency component CCj. 

The reasoning is similar in the latter case. I 

Now we are ready to prove the main result. For a truth component Di and a unit graph 
G® in it, we call dCclose the up-edges of G® if G® is positive, and the down-edges of G® if Cj is 
negative. The other edges in G® are called d^-close. In other words, given that in the definition 
of Di the negative unit graphs are drawn upside down (see Figure |^, the d'-close edges of G® 
are the pairs of up- or down-edges one of whose endpoints is joined to d® (and similarly for di). 

Claim 4 There is a truth assignment satisfying H with at least one true and one false literal 
in each clause if and only ifT{U,V) admits a sort-matching. 

Proof. To prove the Only i/part, assume that Xi,X2, ... ,Xp have been assigned boolean 
values satisfying H as required. Build M, initially empty, as follows. For each clause Hi, assume 
it contains the literals Xj,xi,Xf, with either a positive or a negative occurrence each. Assume 
without loss of generality, that the (positive or negative) occurrence of Xj is true and that the 
(positive or negative) occurrence of xi is false. 

Then add to M: the edges of C)j close to d'; the edges of C] close to di ; the edges of Cy close 
to d'i or respectively to di depending whether the occurrence oi Xf is true or respectively false; 
the lateral edge joining d[ to G\ and the lateral edge joining di to G®. Moreover, add to M the 
up- or down-linking edges that are needed to saturate three of the four remaining unsaturated 
vertices of G®, G\ and Cy. The remaining unsaturated vertex belonging to a unit graph of Di 
is a vertex y of Cy. If the occurrence of x/ in Hi is false, then this vertex is either ay (when 
the occurrence is positive) or cy (when the occurrence is negative), and is always adjacent to 
e®. If, on the contrary, the occurrence of xy in Hi is true, then this vertex is either (a),)® (when 
the occurrence is positive) or (cj)® (when the occurrence is negative), and is always adjacent 
to a. Then add (y,e') to M if (y,e') is an edge, and add (ei,y) to M in the contrary case, 
leaving thus Ci unsaturated when the occurrence of x/ is false in Hi, and e® unsaturated when 
the occurrence of x/ is true in Hi. Equivalently, Ci (respectively e') remains unsaturated when 
Hi is oversupplied of (respectively true) literals. 

Now, as each consistency component has the same number of positive and negative occur¬ 
rences of its corresponding literal, it results that there are 3fc/2 true literals and 3k/2 false 
literals in H. Therefore, the number of clauses that are oversupplied of true literals is the same 
as the number of clauses that are oversupplied of positive literals, namely k/2 clauses in each 
case. Consequently, in the completion component E the unsaturated vertices induce & k 12- 
regular bipartite graph. By Hall’s theorem [B], this graph has a perfect matching M', that we 
add to M. The construction of the matching is now complete. 

We first have to show that M is correctly built. The construction implies that in every unit 
graph either both up-edges or both down-edges are in M. Inside any consistency component, 
all unit graphs are in the same case among these two cases. To see this, let G)- and G^- be two 
neighboring unit graphs and assume without loss of generality that G® is positive and Gj is 
negative, and also that Xj is true, implying that xf is false (the other cases are similar). Then 
in Di the up-edges of G® are put into M since they are d'-close, whereas in Di the up-edges 
of cy are put into M since they are di-close. Thus in CCj either all the up-edges or all the 
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down-edges are in M, together with the down-, respectively up-, linking edges by the definition 
of M. Moreover, in each Di and in E the matching is correctly built by definition. Thus M is 
correctly built. 

Obviously, M is a perfect matching. Recall that we associate with it a bijective function 
a : V ^ U such that cr(r;) = rt if and only if (u, v) € M. In order to show the inclusion property 
between sets Q{v) := Dom{a{v)) H Dom{v) required by a sort-matching, we show that for each 
pair v,w €V such that v precedes w in the order on V we have Q{v) <iex Q{w). 

This deduction is based on the following seven affirmations: 

Al. for each unit graph G), QUa])') <iex QUb])') <iex QiiCjY)- 

Indeed, as shown above, we have two cases. In the case where the up-edges of G* belong 
to M, we have that tT((&®)') = a®, cr((c®)') = 6®. Moreover, (T((a®)') e {di,ei} if G® is positive, 
and cr((a®)') = c® where G® precedes G® in GGj, if G® is negative. Then, (5((a®)') = Y- = 
\ht + l..ht + 2] and respectively Q{{a'^j)') = [{h — l)t - 1 - 17..(/i — l)t + 18] (where h defines the 
block of G®). See Figure As QUb^Y) = [ht + 5..ht + 6 ] and (5((c®)') = [ht + 11..ht + 12], 
the affirmation follows. In the case where the down-edges of Gj belong to M, we have that 
^iibjY) = dj, cr((a®)') = 6®. Moreover, cr((c®)') = a® where G® (circularly) precedes G® in CCj, 
if G® is positive, and cr((c®)') G {di,ei}, if G® is negative. We deduce that (5((c®)') = Z® = 
[ht+21..ht+22] and respectively Q((cp') = Zj = [ht+23..ht+24:]. As Q((a®)') = [ht+9..ht+10] 
and Q{{bjY) = [/it -I- 15../if -I- 16], the affirmation follows. 

A2. for each pair of consecutive unit graphs G®,GJ (in this order) in the same consistency 
component CCj, Q((c®)') <iex Q((ap'). 

As before, consider first the case where the up-edges of G® belong to M and deduce in the 
same way that (3((c®)') = [/it-1-11../it-1-12] (where h defines the block of G®-). Then, the up-edges 
of GJ also belong to M, therefore Q{{a^Y) [(^ + 1)^ + l..{h + l)t + 2], if GJ is positive, 

and respectively Q{{ajY) = [ht -I- 17..ht + 18], if G^ is negative. The affirmation is then proved. 
Consider now the case where the down-edges of G® belong to M and deduce as before that 
Q{{cjY) = [ht + 21..ht + 22] and respectively Q((cj)') = [ht + 23..ht + 24] when G® is positive, 
respectively negative. We also have in this case that (5((ap') = [{h + l)t + 9..{h -I- l)t -|- 10], 
proving the affirmation. 

A3, for each pair of consecutive unit graphs G®,GJ_|_]^ (in this order) in two consecutive 
consistency components, Q{{CjY) ^lex Q{{a^j+iY)- 

Notice that in this case G® is negative and Gj_^_l is positive, therefore as before we have 
either (5((c®)') = [ht + 11. .ht + 12] or Q((c®)') = [ht + 23. .ht + 24], depending whether the up- 
or the down-edges of G® belong to M. Similarly, we also have )')=¥[ = [{h + l)t + 

l..(/i -I- l)t -I- 2] or Q{{aj_^iY) = [{h + l)t -I- 9..(/i -|- l)t -f 10]. In all four possible cases, the 
affirmation holds. 

A4-. for the last unit graph G), in the last consistency component CCp, we have Q{{CpY) ^lex 
Q(rf'i). 

With the same deductions as above and since Gp is negative, we have that either Q{Cp) = 
[ht + 11..ht + 12] or Q{Cp) = [ht + 23..ht + 24], where h defines the block of Gp, i.e. ht+ 2-1 = 
m = 3kt (with t = 24) since we have 3k literals in the k clauses and thus 3k blocks, of which 
Gp uses the last one. Now, Q{d'i) = [m + l..m + 2] since Dom{d'i) = [m + l..m + 2] and it is 
included in the domains of the three variables yielding vertices adjacent to d'l in Di, and the 
affirmation follows. 

A5. for each i with 1 < i < /c, we have Q{d'Y) <iex Q{d[_^_i). 
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This is obvious by the definition of the domains and the observation that Q{d'g) = Dom{d'^) 
for all s with 1 < s < A:. 

A6. we have Q{d'f^) <iex 

Again, Q[d'f.) = Dom{d'j.) = [m + 2k — l..m + 2k] and similarly Q{e[) = Dom{e[) = 
[q + l..q + 2] where q = m + 2k. The affirmation is proved. 

A7. for each i with 1 < i < /c, we have Q(e9 <iex Q{e'^^i). 

This is obvious by the definition of the domains and the observation that Qie^) = Dom{e'g) 
for all s with 1 < s < A:. 

Affirmations A1-A7 allow to deduce that M is a sort-matching. 

We consider now the If part of the theorem. Assume therefore that a sort-matching exists 
in r(t/, y). By Claim such a matching must contain, for each unit graph, either both its 
low-edges or both its up-edges. Given that the matching is perfect, we deduce that all vertices 
di and d[ are saturated, and therefore that in each Di there is a unit graph Gj whose d'-close 
edges are in M (the unit graph containing the vertex y such that {di, y) € M) and a unit graph 
G\ whose di-close edges are in M (the unit graph containing the vertex z such that (z, d') S M). 
Call the third unit graph in Di. 

Define, locally to Di, the (positive or negative) occurrence of Xj in Di to be true, that of 
xi to be false, and that of x/ to be true if M contains the edges of G'^j, close to d', respectively 
false if M contains the edges of G^ close to di. Deduce a local truth assignment for each literal 
Xj,xi and Xf. Notice that: 

A8. The local truth assignment of the literal Xr, for each r G {j, I, f}, is true if and only if 
the up-edges of G), belong to M. 

Indeed, we have that Xr is true if and only if exactly one of the two following cases occurs: 
either Xr has a positive occurrence in Di, in which case this occurrence has been assigned the 
true value, so the d'-close edges of G\. belong to M; or Xr has a negative occurrence in Di, in 
which case Xr has been assigned a true value if and only if its negative occurrence has been 
assigned a false value, and this happens only if M contains the d^-dose edges of G*. In the 
former case, the d'-close edges are the up-edges of G*. In the latter case, due to the upside-down 
position of negative unit graphs in Di, the d^-close edges are also the up-edges of G*. And we 
are done. 

Due to the affirmato A8 and to Claimwe deduce that all the local truth assignments are 
coherent. Moreover, each clause Hi has at least one true literal and one false literal, namely 
the occurrences of Xj and of xi (with the notations above). I 

5 Hardness of sortedness constraints 

The construction in the preceding section allows us to deduce the NP-completeness of SORT- 
SUPPORT in general, but also in a particular case of it, as follows. 

Claim 5 SortSupport is NP-eomplete. 

Proof. Obviously SortSupport belongs to NP. To show it is NP-complete, apply Claim|^ 


With a slight modification of the domains we defined for the variables, we also have that: 

Claim 6 SortSupport is NP-eomplete, even in the case where the domains Dom{ui), 1 < 
i < n, are pairwise disjoint. 
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Proof. In order to have pairwise disjoint domains, the idea is: (a) to extend the intervals of 
Z that are used by several sets Dom{u) with u G U (thus shifting to right all the other intervals 
so as to avoid unwished overlaps); (b) to cut them into a sufficiently large number of sub¬ 
intervals; and (c) to use a specific sub-interval for each Dom{u), thus insuring the disjointness 
without modifying the relative positions on the real line of the intervals defining the sets Q{v). 

The shared intervals are as follows: Yj = [ht+ l..ht + 2] (shared by Dom{di) and Dom{ei), 
when G* is positive), Zj = [ht + 23..ht + 24] (shared by Dom{di) and Dom{ei), when G* is 
negative), Dom{d'^ = [m + 2i — l..m + 2i] (shared by Xj when G* is positive, and by Tj when 
G* is negative) and Dom{e[) = [q + 2i — l..q + 2i] (shared by Xj when Gj is positive, by Tj 
when G] is negative and by Dom{es) with s ^ i). As an example, consider Yj in the case of 
a positive unit graph G*. As Yj is shared by Dom{di) and Dom{ei), it should be extended 
to an interval of length three {e.g. [ht + l..ht + 4] instead of [ht + l..ht + 2] as it is now), in 
which case Dom{di) and Dom{ei) would be affected a sub-interval each {e.g. [ht + l..ht + 2] 
and [ht + 3..ht + 4] respectively). Of course, in this case, the domain Aj of aj should start at 
ht + 5 instead of ht + 3 (and similarly for the other domains), so as to avoid overlaps. This 
would result into an augmentation of the size t of each block. 

The proof of the correctness is very similar to the one above. The main difference is that 
some intersections between domains are slightly shifted. I 

Theorem 1 Testing whether soit{U,V), soit{U,V, P) or keysoit\ng{U,V, Keys, P) has a sup¬ 
port is NP-complete, even in the case where the variables in U have pairwise disjoint domains. 


Proof. For sort(G, V), the affirmation follows immediately by the equivalence to SortSup- 
PORT noticed in Section|^ and by Claimj^ Furthermore, sort(f7, V) is the variant of sort(f7, V, P) 
where each variable in P has the domain {1,2,..., n}. As testing whether sort(f7, V, P) has a 
support is obviously in NP, the previous remark allows to deduce the NP-completeness of the 
problem. Finally, soit{U,V, P) and keysorting(f7, V, 1, P) are equivalent when the domains of 
the variables in U are pairwise disjoint, since the stability of the sorting is trivially satisfied by 
any assignment of values to the variables. I 

Given a constraint C defined as in Section enforcing domain consistency requires to test 
whether for a given variable Wi and a given value Si G Dom{wi), a support of C exists assigning 
the value Si to Wi. We can easily deduce that: 

Corollary 1 Enforcing domain consistency for each of the constraints sort{U,V), sort{U,V, P) 
and keysorting(t7, V, Keys, P) is intractable, even in the case where the variables in U have 
pairwise disjoint domains. 

Proof. By contradiction and for each of the three constraints, assume a polynomial algo¬ 
rithm A exists for testing the existence of a support with a given value for a given variable. 
Recall that, by Claim an instance H of Not- All-Equal 3SAT is satisfiable if and only 
if T{U,V) admits a sort-matching, and this latter affirmation holds if and only if sort(17, E) 
has a support. By applying A to all the four values in Bj for an arbitrarily chosen variable 
bj, we test in polynomial time whether sort([/, E) has a support. Then we have a polynomial 
algorithm for solving Not- All-Equal 3SAT, a contradiction. The results for sort(17, E, P) 
and keysorting(P, E, iFeys, P) easily follow. I 

Focusing now on enforcing bounds consistency, we need to test whether for a given vari¬ 
able Wi and a given value Si G {inf-p(r(;i),supp(r(;i)}, a t-tuple {Si,S2,... ,St) satisfying C 
exists whose values Sj, j i are more or less constrained. More precisely, Sj must belong 
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to Dom(wj), respectively to [inf'p(ryj).. supx)(z(;j)], and respectively to [infx)(wi), supp(z(;i)] to 
allow bounds(D), respectively bounds(Z) and respectively bounds(IR) consistency. We are able 
to show that: 

Theorem 2 Enforcing houndsfp) consistency for each of the constraints sort(J7, V), sort(C7, V, P) 
and keysoit\ng{U,V, Keys, P) is NP-complete. 

Proof. It is easy to notice that these problems are in NP. We show that the reduction 
from Not-All-Equal 3SAT to SortSupport in Section allows to deduce the result for 
sort([/, E). Then the other results follow. 

In the instance of SortSupport built in Section]^ the variable d[ has the domain Dom{d'fj = 
[m + 2i— l..m + 2i], so that mfjy^d'^) = m + 2i — 1 and svcpjj{d'^) = m + 2i. Let y = m + 2i — l 
and let us notice that r(t/, E) has a sort-matching if and only if it has a sort-matching such 
that y G Q{d'f). This is an easy consequence of the observation that Dom{d'f) has non-empty 
intersection with another domain if and only if it is included in it, i.e. if and only if y belongs 
to the intersection. Then by Claim we deduce the NP-completeness of testing whether there 
is a support of sort([/, E) assigning to d' the value y. I 

Notice that the previous result is not proved for pairwise disjoint domains of variables in 
U. The reason is that in this variant the domain of d' strictly overlaps the domains of other 
variables and the proof of Theorem]^ is no longer valid. 

6 Conclusion 

In this paper we have shown that the three sortedness constraints defined up to now are in¬ 
tractable, even in the particular case where the variables to sort have pairwise disjoint domains, 
and even if we do not seek domain consistency but only enforcing bounds (D) consistency. The 
tractability of the lower levels of bounds consistency, i.e. bounds(Z) and bounds(IR) consistency, 
is shown for sort([/, E) 17], but is still open for sort(C7, E, P) and keysorting(f7, E, Keys, P). 
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